Picture for Ziyang Wang

Ziyang Wang

RoboSurg-VQA: A Multimodal Benchmark for Surgical Segmentation-Aware Visual Question Answering

Add code
May 21, 2026
Viaarxiv icon

EgoMemReason: A Memory-Driven Reasoning Benchmark for Long-Horizon Egocentric Video Understanding

Add code
May 11, 2026
Viaarxiv icon

KAN Text to Vision? The Exploration of Kolmogorov-Arnold Networks for Multi-Scale Sequence-Based Pose Animation from Sign Language Notation

Add code
May 10, 2026
Viaarxiv icon

GRASPrune: Global Gating for Budgeted Structured Pruning of Large Language Models

Add code
Apr 21, 2026
Viaarxiv icon

Doc-V*:Coarse-to-Fine Interactive Visual Reasoning for Multi-Page Document VQA

Add code
Apr 15, 2026
Viaarxiv icon

HICT: High-precision 3D CBCT reconstruction from a single X-ray

Add code
Apr 01, 2026
Viaarxiv icon

TimeWeaver: Age-Consistent Reference-Based Face Restoration with Identity Preservation

Add code
Mar 24, 2026
Viaarxiv icon

MeInTime: Bridging Age Gap in Identity-Preserving Face Restoration

Add code
Mar 19, 2026
Viaarxiv icon

Multimodal Fact-Level Attribution for Verifiable Reasoning

Add code
Feb 12, 2026
Viaarxiv icon

Agent Mars: Multi-Agent Simulation for Multi-Planetary Life Exploration and Settlement

Add code
Feb 09, 2026
Viaarxiv icon